Ontology - Based Automatic Text Summarization

نویسندگان

  • Krzysztof J. Kochut
  • John A. Miller
  • I. Budak Arpinar
  • John Miller
  • Gurinder Gosal
چکیده

by MEGHANA VISWANATH (Under the Direction of Krzysztof J. Kochut) ABSTRACT This thesis presents an ontology-based approach to automatic extractive summarization of text. Most of the extractive summarization systems so far have used statistical importance measures to determine importance of sentences. We use a knowledge-based approach which makes use of ontological knowledge to determine sentence importance. The Wikipedia ontology is the source of this knowledge. A sub-graph of the ontology is extracted after mapping the input document onto the ontology. The sub-graph, called the Thematic Graph, contains ontology concepts which match the terms in the document and edges from the ontology which represent relationships between the concepts. Hence, the thematic graph represents the theme of the input document. The thematic graph thus obtained is then analyzed using various graph-based importance measures to determine the relative importance of nodes. These values are used ultimately to decide which sentences are included in the summary for the document.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A survey on Automatic Text Summarization

Text summarization endeavors to produce a summary version of a text, while maintaining the original ideas. The textual content on the web, in particular, is growing at an exponential rate. The ability to decipher through such massive amount of data, in order to extract the useful information, is a major undertaking and requires an automatic mechanism to aid with the extant repository of informa...

متن کامل

Systematic literature review of fuzzy logic based text summarization

Information Overloadrq  is not a new term but with the massive development in technology which enables anytime, anywhere, easy and unlimited access; participation & publishing of information has consequently escalated its impact. Assisting userslq    informational searches with reduced reading surfing time by extracting and evaluating accurate, authentic & relevant information are the primary c...

متن کامل

Biogeography-Based Optimization Algorithm for Automatic Extractive Text Summarization

    Given the increasing number of documents, sites, online sources, and the users’ desire to quickly access information, automatic textual summarization has caught the attention of many researchers in this field. Researchers have presented different methods for text summarization as well as a useful summary of those texts including relevant document sentences. This study select...

متن کامل

Ontology-Based Automatic Text Summarization Using FarsNet

To summarize a text means to compress the text source into a shorter text in a way that the informational content is kept the same. With regard to the irregular volume of information available on the internet, manual summarization of huge volume of information by humans will be very arduous and difficult. There have been many activities in the field of automatic summarization so far. However, a...

متن کامل

Constructing an Ontology for WWW Summarization in Bone Marrow Transplantation (BMT)

We describe an ontology for WWW summarization in Bone Marrow Transplantation that is currently under construction. It is text-based and qualifies as a grounded ontology. In addition, it is user-centered. For stating medical knowledge, we use first-order logic extended with contexts. The ontology is prepared to serve query scenario formulation, text passage retrieval and summarization proper in ...

متن کامل

Automatic Summarization Based On Automaticallyinduced Ontology

In this paper, we proposed an ontology-based method for summarizing documents. Automatic summarization based on ontology is considered better than other methods. However, existing methods require the ontology to be manually constructed and maintained, which is subjective and time-consuming, so the ontology-based method is not used often. In addition, existing summarization methods consider only...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009